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Abstract 

We study the entanglement entropy in a relativistic quantum field theory for regions 

which are not included in a single spatial hyperplane. This geometric configuration cannot 

be treated with the Euclidean time method and the replica trick. Instead, we use a real 

time method to calculate the entropy for a massive free Dirac field in two dimensions 

in some approximations. We find some specifically relativistic features of the entropy. 

First, there is a large enhancement of entanglement due to boosts. As a result, the 

mutual information between relatively boosted regions does not vanish in the limit of zero 

volume and large relative boost. We also find extensivity of the information in a deeply 

Lorentzian regime with large violations of the triangle inequalities for the distances. This 

£>. \ last effect is relevant to an interpretation of the amount of entropy enclosed in the Hawking 

■ radiation emitted by a black hole. 
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1 Introduction 



In the context of quantum field theory (QFT) the term "entanglement entropy" usually refers 
to the entropy S(V) of the vacuum state reduced to a region V of the space. It essentially 
measures the entropy contained in the vacuum fluctuations in this region. The interest in this 
subject arose initially from an interpretation of the black hole entropy in terms of entangle- 
! ment entropy pQ. Later, several different applications have been developed, showing that the 
entanglement entropy of the vacuum contains information on important aspects of the QFT, 
including renormalization flow j2J[3], topological order p[|, phase transitions [5] and confinement 
[6]. Recently, the possibility of using the entanglement entropy as a tool for developing the 
AdS-CFT dictionary has attracted much interest [7]. 

In contrast with the more intuitive idea of the entropy of a substance contained in a box, 
which under normal circumstances persists on time, the entanglement entropy of a region has 
to be denned for a fixed instant of time (see figure (CD)). After this moment, the reduced state 
on the region V will spread out at the velocity of light. In order to impede this spreading, 
some material box imposing boundary conditions would be necessary. But this is not what we 
are willing to do, since this further element, the boundary condition, spoils the very nature of 
the entanglement entropy, of being a quantity depending only on the geometry of V and the 
particular QFT. 
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Figure 1: The entanglement entropy is a function of pieces of spatial surfaces like the set A in 
this figure (light like lines are shown at ±45°). Spatial sets having the same causal domain of 
dependence (diamond shaped set) such as A and A', have the same reduced density matrix. 
In consequence, a regularization independent quantity such as the mutual information I(A, B) 
coincides with the one between equivalent regions I(A',B'). 

Therefore entanglement entropy has to be thought as a quantity localized in space-time 
[8]. In the relativistic case, V can be any <i-dimensional spatial region (in a d + 1-dimensional 
space-time), included as a part of a Cauchy surface, but not necessarily contained in a flat 
spatial hyperplane. This space-time nature of entanglement entropy is fundamental to the 
entropic c-theorem in two dimensions [2] , but has otherwise not received much attention in the 
literatur^j]. 

This may be attributed to the fact that if V is not contained in a single spatial hyperplane 
the usual Euclidean method to compute entanglement entropy based on the replica trick be- 
comes inapplicable in a direct way. A d- dimensional surface in Euclidean space corresponds 
to a d- dimensional spatial surface in Minkowski only if this is a flat surface. Otherwise, the 
Minkowskian result should follow from the Euclidean one through a complicated analytic con- 
tinuation in the space of regions. 

In this paper we analyze the behavior of the entanglement entropy for non coplanar regions 
in the simplest QFT model given by a free fermion in two dimensions. We choose a fermion field 
instead of a scalar one, since this later involves the treatment of more singular kernels [10] , and 
in addition, in the two dimensional case it develops an infrared divergence in the massless limit. 
We use a real time method based on the explicit expression of the reduced density matrix for 
the free case [HI [12]. The entanglement entropy for several disjoint intervals lying on a single 
spatial line was calculated in [12] using this approach and a small mass expansion. We extend 
these techniques here to obtain the general result for non coplanar sets involving relatively 
boosted intervals. 

The entanglement entropy in QFT contains divergences which are proportional to local 
terms on the boundary of V. The regularization independent information can be isolated using 



1 However, effects of boosts on the entanglement of the spin degrees of freedom of relativistic particles have 
been intensely studied. See for example the review papers [5]. 

2 Given two non-overlapping regions several well defined regularization independent "entropic" quantities and 
entanglement measures can be constructed [13) . 
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for two disjoint regions A and B. This quantity has the interpretation of the amount of shared 
information between these regions, and is always positive. It is also increasing with the size of 
A and B, and it gives an upper bound on correlations between these two sets. We calculate this 
function in some specific approximation, and find that this shared information can be greatly 
increased by the relative boosts between A and B. Surprisingly, the boost enhancement is such 
that the mutual information remains no zero for zero volume sets, provided they lie on null 
surfaces. 

An important physical problem which naturally involves the entanglement between regions 
with a large relative boost is the localization of information in the Hawking radiation process, in 
the semiclassical regime. Indeed, the entropy in the Hawking radiation is entanglement entropy 
with the region hidden across the horizon. The entropy in the Hawking radiation contained 
in a finite region of space is rather small and cannot be isolated from the area terms in the 
entanglement entropy, which are regularization dependent [H]. This can be overcome using a 
regularization independent measure of information. A natural choice is I(A,B), being A the 
black hole (or some piece of it near the bifurcation surface) and B a region far outside the black 
hole, containing Hawking radiation. These two regions are related by an exponentially large 
redshift, which arguably may be simulated by a large relative boost in flat space. 

Physical intuition dictates that the information in the radiation region should be spatially 
extensive, at least for regions which are far from the black hole and are large with respect to 
the typical radiation wavelength. This property of extensivity is however non mathematically 
guaranteed because mutual information is in general a non extensive quantity [H| IT5] . The 
deviation from extensivity is measured by the tripartite information 

I(A, B, C) = I(B, A, C) = I(A, C, B) = I {A, B) + I(A, C) - I(A, BC) , (2) 

which is zero in the extensive case I(A, BC) = I (A, B) + I(A, C). Thus, an important question 
related to the entanglement entropy for non coplanar surfaces is whether an extreme Lorentzian 
geometry may guide us to find a general principle making the mutual information extensive. 

We study this question with our toy QFT model. The massless fermion in two dimensions 
has extensive mutual information, but this property fails for massive fields [12] • In the black 
hole interpretation, the presence of a mass in two dimensions connects the ingoing and outgoing 
modes, and simulates the backscattering which occurs for massless fields in more than two 
dimensions. We find here that some special configurations with high relative boosts between 
the different components can restore extensivity in the massive case. We relate this property to 
large violations of the triangle inequalities for the involved distances, which are allowed by the 
Lorentzian geometry. These large violations of the triangle inequalities are also a key ingredient 
of the black hole evaporation geometry. 

2 Entanglement entropy for a Dirac fermion 

In a two dimensional space-time V is a spacelike curve, which can have several connected 
components. We are interested in a free Dirac field *&(x), and write ^(s) for the field ty(x(s)), 
where x(s) is a parameterization of the points of V by the distance parameter s along the curve. 

The general expression of the reduced density matrix py, corresponding to the global vacuum 
state, and the region V, was obtained in [T2] in terms of the field correlator. We will not need 
this expression here, but only the corresponding one to the associated entanglement entropy 
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S(V). This is given by 



son 



-tr 



C)log(l-C) + Clog(C) 



where C is the operator with kernel 



(3) 



(4) 



Here r](s) = 7°7 M ?7 At (s), and rj^s) is the future directed unit vector normal to the curve V at 
the point x(s). We have fj(s) = fj(sy is hermitian and positive, with detfj = 1, fj 
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7^77^(5)7°. We are using a (1,-1) signature for the metric (time-like vectors have positive 
square). 

This result, as well as the explicit expression for the density matrix, follows from the re- 
quirement tr(pvO) = (0|O|0) for any operator O localized in V, together with the very simple 
structure of the vacuum expectation value for the field polynomials in the free case [TU1 [TTj . 
The density matrix and the entropy require regularization. A more rigorous treatment based 
on the KMS condition is in [15] . 

The density matrix represents the vacuum state on the whole algebra of operators localized 
in V. Because of causality, this coincides with the algebra of operators localized in any other spa- 
tial surface having the same domain of dependence as V [12]. For physical, finite quantities, such 
as the mutual information I(A, B) between two regions, this means that I(A, B) = I (A', B') 
for any pairs of surfaces A, A' and B, B', having the same domain of dependence. This is 
illustrated in figure ([Q). 

In two dimensions we have for the field correlator 



(0\*(x) \P%) |0> 



d,[e((x- y) 2 )sgn(x°-y )] 7 V 
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(5) 



where K a (x) is the standard modified Bessel function, 7 M are the Dirac matrices, and \x\ = 
^/—x^x^. The contribution of the first term on the right hand side of (jS]) to the kernel C of eq. 
(jl]) is just half the identity kernel, 1/2 5{s\ — s 2 ), for any curve V. 

In order to do perturbations it is convenient to express ([3]) in terms of the resolvent kernel 
R— (C —1/2 + (3y\ This is 



S{V) 



d/3 tr 



1/2 



(/3-1/2) (R(J3)-R{-P) 



2/3 



+ 1/2 J 



(6) 



3 The small mass expansion 

According to ([3]) the evaluation of the entropy requires the resolution of the kernel (j5]) involving 
Bessel functions. Unfortunately this is not known at present. Here we take advantage of the 
known expressions for the spectral resolution of the massless kernel in order to make a small 
mass expansion for the entropy. This means that we have to consider all the typical distances 
in V to be smaller than m _1 . It is assumed this holds for the rest of the paper, unless otherwise 
noted. 
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The expansion for the correlator reads C — Cq + G\ + C 2 + with 

Co(x,y) = \d»[9((x-yf)sgn(x -y )]ri° -^^jf^7°, (7) 



Ci(*,y) = -— ^ + bg (-L_JiI j j y> , (8) 

a,(s,y) = -^(^-^ + log(^^))(^-^7°, (9) 

and 7^ is the Euler constant. The perturbative expansion in the mass involves non- commuting 
kernels Co and Cj, z > 0. Hence, it is convenient to use the formula for the entropy in terms of 
the resolvent, and expand the resolvent as 

R v {(3) = R° V (P) - ifyfflCiltyiP) - R° V (/3)C 2 R° V (P) + RU^RU^R^) - ... , (10) 

where Ci(x,y) = Ci(x,y)fj(y), and Ro = (Co — 1/2 + A straightforward analysis of the 

possible terms for the expansion in entropy formula shows that the series can be written in 
terms of powers of m and log(m) as [T2J IT?] 

co 2i co 2i 

5 =EE^-EE ^ m2% lo g J ( m ) • ( n ) 

1=0 j=0 1=0 jr'=0 

Massive corrections for the entanglement entropy of free fields in more dimensions are studied 
in [TBI. 



3.1 The massless contribution 

In the massless case the problem factorizes in the two chiralities. The massless correlator Cq 
diagonalizes in the chiral representation for the spinors, that is, the base where 7 3 = 7°7 1 = 
diag(l, —1) is diagonal. In order to see this we parameterize the coordinate differential tangent 
to the curve V as 

(dx 11 ) = (sinha(s), cosha(s)) ds . (12) 
Hence, the unit vector normal to V is 

(rjfj) = (cosh a, — sinh a) , (13) 

and we have in the chiral representation for the Dirac matrices 



l = {o *>)■ (14) 

Using null coordinates 

u± = x°±x\ (15) 

we can write C as 

Co ={ D + (ul,ul)J- (16) 

Here D± are scalar kernels having the same expression 

1 % 1 

D ± ( Ul ,u 2 ) = -8{u 1 -u 2 ) - — r, (17) 

2 2tt [ui — u 2 ) 



Figure 2: The projections of the set V on the null coordinate axis. V has three connected 
components in this example. The labeling of the extreme points of the intervals in V is done in 
increasing order for the null coordinates. Note for example that bf and a± are null coordinates 
corresponding to the same point. 

but different domains, given by the projections V± of V on the null axis. The kernel fTTTT) is 
understood in the principal value regularization. Note that we have used the relations 

du + = e a ds , du_ = —e~ a ds , (18) 

in order to change coordinates and rewrite the operator Cq. In (j4]) it is expressed as a kernel 
in the distance variable, while in (Tl6|) it is understood that D + (u\,u v + ) and D_(u 1 L,u y _) act as 
kernels on the u + and w_ variables respectively. 

Let us call li, l n to the left extreme points of the n different connected components of V 
ordered from left to right in the spatial coordinate, and r±, ...,r n for the right extreme points. 
The projection of V onto the null coordinates is formed by a union of disjoint intervals, which 
we call V± = (af,bf) U ... U (af, bf), with af < af +1 . Hence we have simply af = if and 
bf = rf, while = r~_ i+1 and bj = C-i+i ( see figure (EJ)). The ordering for the extreme 
points in V~ is inverted with respect to the natural left to right ordering because of the sign 
in (USD. 

The spectral decomposition of the scalar kernels D± in arbitrary mult i- component sets is 
known [121 US]- We review the main properties of the kernel (x — y)~ l in the Appendix. Using 
this decomposition, a straightforward calculation of eq. 03), which we do not repeat here, leads 
to the result [12] 

<So,o = $0,0 + $0,0 > (19) 
5 °to = I (5^1og|fof - af| - ^log|a± - a±| -^l og |6f -^logej . (20) 

Here e is a short distance cutoff. It appears in the calculation of the trace in ()3]), where the 
integration is taken from af + e to bf — e in each interval. For a single interval the entropy 
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is proportional to the logarithm of the interval length, and this is a general result for two- 
dimensional conformal theories [20] . 

The mutual information is of course finite. A simple calculation shows it can be written as 



hAA b ) = - 3 j a j b ^ d °B j^bj [ g,* - ^ 

where x A and t\a (repectively x B and tjb) are functions of sa (sb) giving a parameterization of 
the position and the normal along the curve A (B). It can also be written as a sum over the 
two chiralities using ( I20p . In particular for two single interval sets A and B we have 

T (A - 1 In, f (bl-bt)(ai-at) \ 1 f (b 2 - K)(a^ - a^) \ 

l ,o(A, B) - - log — — + - log — — , (22) 

6 V>2 -&i)(&2 - a i)J 6 V(a 2 - & i)( & 2 -Oi)/ 

which is a function of the cross-ratios of the projections of the extreme points on the null axis. 
For more general CFT the equation corresponding to (1221) may contain an additional term 
which is function of the cross ratios [21]. However, unlike the logarithms in ( 122]) this new term 
is a bounded function [2]. 

The expression ( 121]) as a double integral shows immediately that the mutual information is 
extensive in the massless limit, 

I 0i0 (A,B,C) = 0. (23) 

In the following section we compute the first terms in the small mass expansion of the 
entropy. It is then natural to express all the kernels in the chiral base, where the massless one 
is diagonal. In order to do this we will need to write traces of products of operators in the chiral 
representation for the Dirac matrices, and as integrals of kernels acting on the null coordinates. 
This is done with the useful formula 



dsids2-..ds n tr 



A 1 (s 1 ,s 2 )A 2 (s 2 ,s 3 )...A n (s n ,s 1 ) = (24) 



5^ J du qi J du q2 ... J du qn A\ iq2 (u qi ,u q2 ) A 2 q2q3 {u q2 ,u q3 ) ...A™ 



q n qi \ U qn^ u qi, 



qi,q2,-,q n =+,- v v 



In this formula A l (si,s 2 ) = A l (si, s 2 )^(s 2 ) is a kernel in distance coordinates, and A l pq (u p , u q ), 
with p, q = ±, is the (p, q) matrix element of A 1 (without the fj factor), in the chiral represen- 
tation, evaluated at the points Si(u p ) and s 2 (w 9 ). The eq. §M§ follows directly from ([14"]) and 
ffi 



3.2 Massive corrections 

In this section we calculate the leading log terms and 52, i i n the expansion for the entropy, 
using the expansion for the resolvent (ITU]) . We need to expand to this order because S 2 ,i is the 
leading non extensive term. The massless resolvent is diagonal 

K(j})= (K-(P) o \ (25) 



o KM 

Using the spectral decomposition of the Appendix we have 



n f + OO 

i^Gff)(«L4) = E / ds* k ±(ul)m(P,s)y k ±*(ul), (26) 
fc=i J -°° 
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with 

m(/3, s) = ^9 - ^ tanh(vrs)^ . (27) 

The leading massive term in the series is proportional to m 2 log 2 (m) and corresponds to the 
fourth term in ([TO]) . We display with certain detail the calculation of this term in order to 
exemplify the main technical steps. These are essentially the same for the calculation of the 
following terms in the expansion. We have 

+00 

S 2 , 2 = -^log 2 (m)y rf/3(/3-l/2)tr [ J R y (/3)l 7 r / ^(/3)l7°^(/3) 



1/2 



-^(-/3)l 7 °^(-/3)l7°^(-/3) , (28) 



where the kernel l(x,y) = 1 for all x, y. Using the representation (124)) the first term in the 
trace in this last equation writes (the second one follows just replacing ((3 — > —j3)) 



JdxJdyJduJ dwR° v _ (/3) 2 (x, y)R° v+ (/3)(u, w)+ 
v- v- v+ v+ 

+ JdxjdyJduj dwR° v+ {(3) 2 {x,y)R° v -{(3){u,w) . 
v+ v+ v- v- 

This can be further expanded using the spectral decomposition as 

+00 

n „ 

J2 / dsds'm((3 lS ) 2 m(P,s')\A-(s,k)\ 2 \A + (s',k')\ 2 + (+ ^ 

,k'=l „ 



(29) 



(30) 



k 

Here we have written 



A±(s,k) = / dxV^ix). (31) 
Jv± 

Combining eqs. (166|) and (168|) of the Appendix we have the useful property 

n 

|A±(s, fc)| 2 = ^sech 2 (vr S )L ± , (32) 



\A±(s,k)r = 7 ' 

k=l 



with L± = — a i)- Using ( l32l) . the sums in (|30|) are solved, and we end up with integrals 

in s, s' and /3 in ( 1281) . These can be evaluated analytically, leading to 

2 

777 

£2,2 = -— log 2 (m)L+L_. (33) 


This simple quadratic expression also gives an extensive contribution, 

I 2>2 (A,B,C) = 0. (34) 

In order to find the first non-extensive term we calculate the next logarithmic order proportional 
to m 2 log(m). This is produced from the third and fourth terms in (fTUI) (the second term does 
not contribute). Following [T2], we write this contribution as a sum of three terms, 

S 2 ,i = s 2 ,i m 2 log(m) = A! + A 2 + A 3 . (35) 
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We call Ai to the contribution coming from the terms in C\ which does not contain log \x — y\, 
A2 to the one coming from C2, and A3 to the one involving the term proportional to log \x — y\ 
in C\. More explicitly, 



Ai 



m 



2tt 2 



(r B -ln2)log(m)y dfi{p- 1/2) tr R" Tj R u lj" fj - (P <B- ~P) , (36) 
1/2 



im 



47T 



m 
2^2 



log(m) dP(P- 1/2) tr R^df] -/3) 



1/2 
+00 

log(m) J 'dP{P- 1/2) tr 
1/2 



R 02 l-f f)R°O 2 f) - (p -P) 



(37) 



(3? 



where Oi(x, y) — (x — yY 7^7° and Oi{x, y) = log \x — y\ 7 . 

Ai is readily evaluated since the relevant calculation is the same as above, 



Ai 



log(2) - 7£ 



m 2 log(m)L + L. 



(39) 



The contribution of C2 to this order is obtained following the same steps as for the contribution 
5*2,2 • This is, using (124]) to write the trace in the chiral representation and null coordinates, 
then using the spectral decomposition of the massless resolvent, and finally, solving the sums 
and integrals with the help of the formulas in the Appendix. In this case, we have to use (I67j) 
because of the explicit dependence of the correlator contribution in the coordinates. After some 
algebra we get an expression in terms of a one-dimensional integral over V±, 



A, 



m 



log(m) 



du_u> (u) tanh 



du+U- (u-l.) tanh 



LV- 



where we have defined 



z±(u±) 



log 



n*(«± 



(40) 



(41) 



I L 2 m 2 log(m), which is the result 



rii(«±-«4 

For the special case of a set lying in a single spatial hyperplane with total length L 
and choosing the straight line curve, ( 140]) simplifies to A 2 = | L 2 m 2 
presented in [12]. 

Finally, the calculation of A3 starts as above, from (138]) . passing to null coordinates through 
( [24]) , and expanding the resolvent in its spectral decomposition in terms of the eigenvectors ( 126]) . 
At this point, it is convenient in this case to further write the eigenvectors explicitly using ( 1ST]) . 
The sums over the discrete indices of the eigenvectors are done using (169]) . Then, one is left 
with an integral over the eigenvalues s and the resolvent parameter p. These can be done 
(preferably in this order), leading to a delta function in the variable z. The final result is 



A 3 = —2m 2 log(m) J du y _ J du x + log \x — y\ 8 (z {u y _) — z («+)) 



V- v+ 



(42) 



It has to be noted that the results f ]40]) and ( ]42]) depend on the chosen curve within the 
equivalence class of curves having the same domain of dependence as V. The choice of the 
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a 



B C 



b c 



Figure 3: Configuration of two adjacent intervals B and C separated from the interval A by a 
large distance d. All three intervals are collinear. 

curve changes for example the function u + (u_) in (j4"U|) . We know the entropy is independent 
of this choice, but this is reflected in that the whole contribution 5*2,1 is curve-independent, 
while A 2 and A 3 separately are not. We have checked this numerically in several examples. In 
particular it is possible to choose the curve formed by the null future (or past) horizon of the 
domain of dependence of V, which simplifies some calculations. 

For a single interval of length L the integrals can be evaluated analytically and the series 
reads 

crr\ 1 ^ (ri\ m2 i 2, ^ T 2 . log(2) - 7^ + 5/6 2 2 
S{L) = - log(L/e) - — log (m)L + m log(m)L 

-- log(L) log(m) m 2 L 2 + 0{m 2 log°(m)) . (43) 
3 

The entropy for a single interval can also be expressed in terms of a solution of a Painleve 
ordinary differential equation [17] . The expansion for small mass (143)) coincides with the one 
given by this differential equation. 

Of course, for more than one interval the result does not depend only on the total lengths 
L± = ^2(bf — af). In particular, the contributions A 2 and A 3 to S 2i i give place to a non 
extensive mutual information (see Section 4.2 below). In order to see this analytically in a 
special limit, consider three collinear intervals A, B and C, of lengths a, b and c respectively. 
Take the distance from a and b to be d 3> a, b, c, and C adjacent to B on the side opposite to 
A (see figure (jBJ)). In the large d limit we can compute the leading terrrjfl 

I(A, B, C) ~ m 2 log(m)^— {be [-98a + 33(6 + c)] + 120a6clog(a) - 30(2a - b)b 2 log(6) + 

450a 2 

30{c 2 (-2a + c)log(c) + (2a - b - c)(b + c) 2 log(6 + c) - be [8a - 3(b + c)] log(d)}} .(44) 

4 The regime of large relative boosts 

In this section we describe in more detail the behavior of the mutual information in different 
situations which have in common the presence of large relative boosts between the involved 
regions. Specifically, we focus on the geometric configurations shown in figures (jlj) and (J5J). 

4.1 Two boosted intervals 

We consider two sets, which for simplicity we take to be single intervals of lengths a and 6, 
separated by a distance d. The separating interval d can be positioned on the x 1 axis without 
loss of generality. In order to completely fix the configuration we have to specify two more 

3 This equation corrects a mistake in the eq. (32) of [T5] . 
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(a) 



(b) 



Figure 4: Two different limits of vanishing length and divergent relative boost for the intervals 
A and B. 

parameters, the hyperbolic angles a and (3, determining the relative boosts of A and B with 
respect to the separating interval. Thus, the size of the projections to the null axis are a + = ae a , 
a_ = ae~ a , b + = be^ and 6_ = be~^. We are interested in the limit of I(A, B) when a and b 
approach zero. It is easy to see that if a or (3 (or both) are kept bounded while a and b tend to 
zero then I(A, B) also tends to zero. This is a natural result, since it is expected that for a fixed 
distance d the mutual information should vanish with the progressive elimination of degrees of 
freedom in A and B. However, if we take the limit of zero size, but at the same time increase 
the modulus of the boost parameters a and (3 such that the null coordinate projections of A 
and B are kept finite, then the mutual information takes a finite limit value. 

Two different cases have to be considered. These are shown in figure fll]). First the limit in 
which a — > oo and /3 — > oo, while a, b — > 0, in such a way that a + = ae a and b + = be@ are finite. 
In this limit we have a_ = 6_ = (figure fll]) a). The case of vanishing a + and b + (a, (3 — > — oo) 
is analogous. The massless contribution to the entropy does not vanish in this case, and we 
have 



Only the cross ratio of the u + projections matter to this massless contribution. 

Two different ingredients seem to be necessary for an explanation of this result. First, the 
quantum field theory contains infinite many d.o.f. in any finite volume region of any size. This is 
important, otherwise the number of d.o.f. would vanish in the zero volume limiifl Here, it also 
seems important that in the massless limit the theory decomposes into the two chiralities and is 
conformally invariant. Hence, the plus chirality acts as a one dimensional translational invariant 
system independent of the minus chirality, and the finite result for I(A, B) is a consequence of 
the finite values of a+, b + and d + . The physical size of the a and b intervals is of course zero 
in the limit, but in a conformal theory there is the same amount of shared information in any 
rescaled geometric configuration. Thus, the explanation in terms of the conformal limit is that 
a, b and d tend to zero, but only the ratios of their projections in the u + axis are relevant. 

However, such an explanation combining chiral decomposition with invariance under scaling 
is helpless in our next example. The second case we want to consider is a limit a + = 0, 6_ = 0, 
a_ and b + finite (see figure (j4])b) . In this case the conformal contribution vanishes, and the 

4 Infinitcly many d.o.f. in a finite volume is also related to the fact that I(A, B) diverges when A and B come 
into contact for generic A and B. 




(45) 
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leading log term is given by S 2j 2, 



9 

777 

I(A,B)~— log 2 (m)a + b_. (46) 
6 

Again we have a non zero mutual information for zero volume regions. This case is perhaps 
more surprising than the previous one since no interpretation as entanglement between small 
but nearby regions seems possible. It suggests that amplification of entanglement by the large 
relative boost is a genuine effect that can lead to finite shared information for vanishing small 
regions separated by a finite, fixed distance. 

In order to clarify the role of the massless limit in this phenomenon we can look at a 
different limit for the two boosted intervals. This is a large separating distance limit, when 
d 3> a^,a + ,b_,b + and a_,a + ,b_,b + x m <C 1, but not necessarily dm <C 1. Thus, at leading 
order one uses the massless correlator inside the intervals, and considers the correlator between 
points on A and B as a constant. The calculation then involves the same tools used in the 
previous Section. The main steps are explained in [12], and we do not repeat this calculation 
here. We have for two intervals in the limit of large separation 

1 1 

I(A, B) m 2 (a + 6„ + a_6+) K$(md) + - m 2 (a + b + + a-b-)K?(md) . (47) 

6 6 



The first term of the right hand side corresponds to the continuation of fl46|) to a large separating 
distance configuration, while the second term corresponds to the continuation of (122]) . 

Formula ( )47|) clarifies two issues. First, it is clear that the phenomenon of finite mutual 
information between null surfaces regions extends out of the conformal limit, for both config- 
urations in the figure (jlj). Second, since the configurations include large dimensionless boost 
parameters, one could wonder if the perturbative series is behaving correctly in (f46j) . and if it 
converges. This was expected, because the large boost parameters appear in the series only in 
terms of the distances involved and their ratios, which are all finite and bounded. The formula 
(I4T1) extending the result (1461) to a different situation, clarifies this expectation is correct. 

This behavior of the mutual information for null surfaces is connected with related properties 
for the correlation functions. The mutual information is a bound for correlations |22j . 

z W^aW \\Ub\\ 

where Oa and Ob are normal operators in A and B, and ||0|| is the norm of the operator O 
(the greatest eigenvalue modulus). For the fermion field consider the hermitian operator^] 

O a = [ ds(a*(s)V(s) + &(s)a(s)), O b = f ds (ft(s)V(s) + &(s)/3(s)) , (49) 

J A JB 

where a(x) = for x ^ A, and j3(x) = for x ^ B. Since the square of these operators are 
c-numbers, we have for the norms 

WO A f = {0 A f = [ dsa^sms))-^) , (50) 



.4 



5 An objection is that the operator Oa (@H]), being fermionic, is not really localized in A, since it does not 
commute with Ob for example. However, the application of is easily generalized to the fermionic case, and 
it does only require the expectation value of these operators to be given by the corresponding traces involving 
the density matrices. 
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(a) 



(b) 



Figure 5: Two different configurations of three intervals with a large relative boost between A 
and B or C. The case (a) is extensive in the limit of large relative boosts and fixed sizes a, b 
and c while the case (b) does not show extensivity of the mutual information. 

and analogously for B - We also have, (O a) = (Ob) = and 



where a and b are the lengths of A and B, \\fj\\ v is the maximum of the norm of fj(x) for x G V, 
and ||C|| AB = m&x x( zA,y£B \\C(x,y)\\ is the maximum of the absolute value of the eigenvalues 
of correlator between A and B. Then, if the slope of the curve is bounded, \\f)\\ A and \\f}\\ B are 
bounded according to (fl4l) . There is no possibility for ( !52l) but to vanish when a — > or b — > 0. 
This is consistent with I(A, B) — > in these cases. However, if a, b — > but ab \\fj\\ A \\fj\\ B 
remains finite (or diverges) in the limit, one can produce a non zero lower bound for the mutual 
information through (jUJ). This is clearly what happens when one takes a, b — > 0, but keeping 
their projections a^, b^ finite. Hence, in the language of operators our observation is that 
boosts allow for a large relative enhancing of operator correlations as compared to operator 
norm. 

We have chosen to put d horizontal in figure @ without loss of generality. In the case 
of d — > 0, the mutual information is either divergent or ambiguous. This last possibility is 
exemplified by the case a~,b~,d~ — > 0, i.e. A, B and D all tend to lie in a null surface. The 
mutual information ( p2|) depends in this case on the ratios of a~, b~ and d~ as they tend to 
zero. 

4.2 Extensivity of the mutual information 

Now we turn attention to the analysis of the extensivity properties of the mutual information. 

The first thing to note is that in general the relative importance of the non extensivity with 
respect to the mutual information does not necessarily decrease with decreasing correlations. 
For example, in the configuration of three coplanar intervals of figure ()3]), the mutual infor- 
mation is dominated by the massless contribution f )22|) (we take a,b,c d <ti m^ 1 ), while 




dsids 2 (a\s 1 )C(s 1 ,s 2 )f3(s 2 ) - f3\s 1 )C(s 1 ,s 2 )a(s 2 )) ■ 



(51) 



Then, using the Cauchy-Schwartz inequality, we have for the right hand side of ([48]) 




(52) 
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m 2 log(m) l(A,B) 
^ d=100 



1,2- 



0,6- 



0,0 







5 



a 



10 



15 



Figure 6: Relative extensivity of the mutual information for the configuration of figure ©a. 
The interval sizes are a = b = c = 1 . 

I (A, B, C) is given by the leading massive term ( l4*4"j) . Then we have, 



This is increasing with the distance d, at least in the range d <C m . 

A partial understanding for this non-extensivity for the configuration of the figure fl3]) follows 
from the geometry. The typical distances dis(^4, B) ~ dis(A, C) ~ d 3> 1 while dis(-B, C) G 
(0,6 + c). This is a typical Euclidean regime, where the triangle inequalities are satisfied. In 
this case the geometry cannot enforce extensivity for the mutual information. This is because 
while A is correlated similarly with B and C, the correlation between B and C is higher or 
similar to the one they have with A. Thus, A is not entangled independently with B and C, 
but rather there is an important part of the correlations which are truly tripartite, as shown 
by the actual calculation of I (A, B, C) (see ( 15^]) ). 

In this same line of thought, one could expect that if there is a condition enforcing extensivity 
which is general enough to be useful to the applications on black hole physics, the configuration 
has to be largely non-euclidean, with great violations of the triangle inequalities in the distances. 

A possibility is to search for situations where I(B, C) <C I {A, B), I(A, C). This is typically 
the case of the Hawking radiation, with large and uncorrelated asymptotic regions (represented 
by B and C) which are however correlated with the black hole (represented by A). In this case 
the state psc is well approximated by a product state. This is still not equivalent to extensivity 
since in this case {I(B, C) ~ 0) one has only a one sided inequality, 

HAB,C) = I(A,B) + I(B,C)-I(B,AC) I(B,C) 
I(A,B) I(A,B) I(A,B) ■ 1 ] 

Here we have used the monotonicity property of the mutual information, I(B, AC) > I(B, A). 
In the context of quantum entanglement measures a relation like this one, which corresponds 
to E(A, B, C) = E(A, B) + E(A, C) - E(A, BC) < for the entanglement measure E, has 
been called monogamy of entanglement [23]. Some entanglement measures (for example the 
squashed entanglement [21]) always satisfy this inequality. This is not the case of the mutual 
information, which also measures classical correlations, and in general is non-monogamous, 
excepting special situations as the one we are focusing here. 




I(A,B,C) bc(-8a + 3(b + c))\og(d) 



(53) 
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Figure 7: Relative extensivity of the mutual information for the configuration of figure (JS])b. 
The interval sizes are a = b = c = 1. 

In order to have I(B, C) < I (A, B), I (A, C) we need in general that dis(A, B) ~ dis(A, C) < 
dis(-B, C), a deeply Lorenzian regime. The figure (jSJa shows one such a configuration. We again 
have three intervals A, B, C parallel to the x l axes but now there is a high relative boost be- 
tween A and B, and A and C, with opposite signs. Specifically, we write the interval between 
the right-most point in C and the left-most point in A as d(— sinh(a), cosh(a)), and the interval 
between the right-most point in A and the left-most point in B to be d(sinh(a), cosh(a)). We 
are interested in the behavior of the non-extensivity as a function of a. The typical distances are 
dis(A, B) = dis(A, C) ~ d, and dis(5, C) ~ e a ^ 2 d for large a. An example with a = b = c = 1 
and several values of d is shown in figure ([6]). The relevant entropy contributions to I (A, B, C), 
computed by eqs. ()40i) and ( 1421) are given by integrals in one variable. We evaluate them 
numerically. We see the relative tripartite information starts negative for small a. Then, with 
increasing boost it changes sign and peaks with a positive value. For large boost parameter 
a it decreases asymptotically like 1/a. Hence, our expectations are confirmed in this simple 
example where the mutual information is extensive in the limit of large relative boosts. 

It is interesting to note that well in the large boost situation the tripartite information is pos- 
itive, violating the strict monogamy relation I(A, B, C) < 0. This is good, since when this rela- 
tion holds we necessarily have extensivity in the limit of large a because < I(A, B, C) /I (A, B) < 
I(B,C)/I(A, B) 0. Note also that the case of fixed a and increasing d does not lead to 
I(A, B, C)/I(A, B) — > 0, similarly to what happens for the collinear case (j55|) . This is expected 
in accordance with the fact that the distances dis(A,B), dis(yl,C), and dis(B,C) all increase 
proportional to d in this case, and there are no large violations of the triangle inequalities. 

However, a connection between triangle inequalities and extensivity of information is not the 
whole story, and a further consideration of the relative boosts between the regions is necessary. 
Consider the geometry of figure (jS])b. In this case even if the configuration enhances the 
correlations of A with B and C over the one between B and C, by approaching A and B to 
a common null line, this is done in a non fully satisfactory way. We again take A, B and C 
parallel to the x 1 axis, the distance between A and B equal to d, and the interval separating 
the right-most point of B to the left-most point in A to be d(sinh(a), cosh(a)). The typical 
distances are dis(A, B) ~ d, dis(v4, C) E (0, a + c) and dis(5, C) ~ e a / 2 \/2ad for large a. The 
figure (j7|) shows an example with a = b = c = 1 and several values of the separating distance 
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d. The quantity of interest is the relative importance of the non-extensivity, measured by 
I(A, B, C) I (I (A, B)m 2 log(ra)). The mutual information I(A, B) is dominated by the massless 
contribution, which in the large a limit behaves linearly I(A, B) ~ a. However, the tripartite 
information is linear in the boost factor exp(a), as shown in figure (J7|). Therefore the relative 
non-extensivity grows rapidly with the boost. 

Therefore, in this case, even if the triangle inequalities are largely violated we still do not 
have extensivity. The reason can be traced back to the fact that here large dis(S, C) does 
not mean small I(B,C). This is the same phenomenon we discussed in the last section. The 
distance between B and C increases as ea but the relative boost of B and C with respect to 
the separating interval is also proportional to this factor. In fact, the massless contribution 
gives a non vanishing I(B, C) in the limit of large a. In order to have asymptotic extensivity 
a bound on the relative boosts seems to be necessary. 



5 Concluding remarks 

We have obtained the entanglement entropy for a free fermion in two dimensions for a completely 
general relativistic region. This has been done in a small mass approximation. 

We found a large amplification of shared information by boosts. This gives a non zero mutual 
information between two sets of arbitrary small size at a fixed minimum distance between each 
other, provided the relative boost diverges in the limit of small size. A partial understanding 
of this behavior in terms of correlators was provided. In more dimensions the analog effect is 
a non zero mutual information between null surface regions. 

This effect can also be understood in terms of the operator product expansion (OPE) of 
certain operators. In two dimensions the traces of integer powers of the reduced density matrix, 
tr(py) = e - ^ -1 ) 5 "^, where S n (V) is the Renyi entropy of index n corresponding to V, can be 
written in terms of operator correlators [251 126]. We have 

e -(n-l)Sn(lurx,..,h,r k ) = (Ol^O^n).. J n (Z fc )$„(r fc ) |0> , (55) 

where V = (Ji, r 1; 4, r^) contains k intervals, and $ n (x) is the CPT conjugate of the field 
&n(— In the Euclidean formulation the fields $ n ( X^j £1X6 £L special class of twisting operators 
[26J. The entropy is a limit of the Renyi entropies for index n — > 1, and can be obtained 
by analytic continuation in the variable n. Then, it is expected (and often found) that the 
Renyi entropies and the entropy have a similar behavior as functions of the coordinates. In this 
representation the limit of small size of the j th interval corresponds to the limit of the correlator 
in which there is a product $ n (Zj-)<fr n (rj) for lj — > rj. In this limit we are able to use the OPE 

m 

~- 7i — 1 / n 

$ n (-x/2)$ n (z/2) ~ c\x\ b- 1 , (56) 

which is proportional to the identity operator to leading order. Hence, the limit of the Renyi 
mutual information I n (A, B) = S n (A) + S n (B) — S n (AB) for sets containing an interval with 
vanishing small length emerges naturally from (15611 . This reads simply 



and 



hm I n ((l A , r% (if, rf , .... Jf, rf )) = , (57) 

l A — >r A 



In{{l\i r ii •••) if -i fft -i ifi T k)i {ill T li ■1 iqi r q)) 
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where in the right hand side the marked points on the Renyi mutual information argument are 
absent, and thus the first set has only k — 1 intervals. That is, the vanishing small size interval 
just disappear from the argument of the mutual information. 

However, if we take the limit l A — > r A by approaching a null line (while leaving all other 
points fixed) the OPE has a very different content. This is because boost and distance are 
scaled proportionally to each other in this limit, and the series involves an infinite number of 
operators of increasing dimension and spin even at leading order [27] • The impossibility of 
replacing the product of the two approaching operators by a finite combination of operators at 
some given point is clearly seen for example in figure fll]). Even if the two end points I a and 
va of A are at small distance from each other, their distances dis(/^,x) and dis(r j 4,x) from a 
point x G B are very different. 

Based on the lessons learned so far from this model, we are tempted to conjecture that the 
mutual information is extensive, I(A, B, C)/I(A, B) — y 0, in the limit of large violation of the 
triangle inequalities dis(A,B) ~ dis(A,C) <C dis(B,C), and when the relative boosts involved 
in the configuration of BC are kept bounded. This is the appropriate geometric scenario for 
analyzing the information shared by a black hole with the radiation region. Indeed, we are also 
tempted to suggest that this may be a general effect, leading to information extensivity in the 
Hawking radiation. 

A proof of this conjecture in Minkowski space may involve the analysis of the clustering 
properties of the operators in this particular geometrical limit, and a more complete understand- 
ing of the relation between the mutual information and the correlators than the one provided 
by (I48p . This configuration corresponds to the OPE in a limit involving several momentum 
becoming large simultaneously. This is reminiscent to [28], where the information content of 
Hawking radiation was linked to the understanding of virtual particle collisions in the limit of 
very large centre of mass energies. 



Appendix: Spectral resolution of the kernel (x — y) 



-l 



In this appendix we display the spectral decomposition of the kernel (x — y)~ l (with principal 
value regularization) in a generic multi-interval set V — (a 1; bi) U ... U (a„, b n ) C R, and collect 
the properties of the eigenvectors, which are relevant to the calculations in the main text. Most 
of these properties appear in [T2] (see also [T9~])- 
The eigenvectors ^> k , 

f dy - J— = A s **(z) , k = 1, .., n, (59) 

Jv x ~v 

corresponding to the eigenvalue 

X s = i7rtanh("7rs) G (— in, in) (60) 

for any s G R, are n-fold degenerate, where n is the number of intervals. An explicit expression 
for an ortho normal basis of eigenvectors ^(x) reads 

J-is+l/2)z(x) 

= Vt — r ' ( 61 ) 

N k (x - a k ) 



17 



where 



z{x) = iog (-fefcS) ■ t (62) 

Nt ~ ^{h fo-«diw^-fc>~n,-(<*-*/>J ' (63) 

The eigenvectors satisfy the orthonormality relations, 



fc=i 

Besides we have the integrals 



/ dx^*(x)^(x) = S kM S(s-s') 
Jv 

" POO 

/ da*?(x)* k a (v) = 6(x-y). 



(64) 
(65) 



/ dxV*{x) = (-l) n+1 vrsech(vrs)A^- 1 , (66) 

Im f dxx^ k {x) = (-l) n svrsech(7rs)iV fc 7 1 L, (67) 
Jv 



where L = — a «) * s the sum °f the 71 interval lengths. 

We also employ the useful algebraic identities 



i=l 



n 1 

X^On-a*)" 1 = ^(l + e" z ). (69) 

fc=l 
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